Selecting Features for Paraphrasing Question Sentences
نویسندگان
چکیده
In this paper we investigate several schemes for selecting features which are useful for automatically classifying ques tions by their question type We repre sent questions as a set of features and compare the performance of the C machine learning algorithm using the dif ferent representations Experimental re sults show a high accuracy rate in cat egorizing question types using a scheme based on NLP techniques as compared to a scheme based on IR techniques The ultimate goal of this research is to use question type classi cation in order to help identify whether or not two ques tions are paraphrases of each other We hypothesize that the identi cation of fea tures which help identify question type will be useful in the generation of ques tion paraphrases as well
منابع مشابه
مقایسه روشهای مختلف یادگیری ماشین در خلاصهسازی استخراجی گفتار به گفتار فارسی بدون استفاده از رونوشت
In this paper, extractive speech summarization using different machine learning algorithms was investigated. The task of Speech summarization deals with extracting important and salient segments from speech in order to access, search, extract and browse speech files easier and in a less costly manner. In this paper, a new method for speech summarization without using automatic speech recognitio...
متن کاملLexical Paraphrasing for Document Retrieval and Node Identification
We investigate lexical paraphrasing in the context of two distinct applications: document retrieval and node identification. Document retrieval – the first step in question answering – retrieves documents that contain answers to user queries. Node identification – performed in the context of a Bayesian argumentation system – matches users’ Natural Language sentences to nodes in a Bayesian netwo...
متن کاملAutomatic Expansion of Equivalent Sentence Set Based on Syntactic Substitution
In this paper, we propose an automatic quantitative expansion method for a sentence set that contains sentences of the same meaning (called an equivalent sentence set). This task is regarded as paraphrasing. The features of our method are: 1) The paraphrasing rules are dynamically acquired by Hierarchical Phrase Alignment from the equivalent sentence set, and 2) A large equivalent sentence set ...
متن کاملHybrid System Combination for Machine Translation: An Integration of Phrase-level and Sentence-level Combination Approaches
Hybrid System Combination for Machine Translation: An Integration of Phrase-level and Sentence-level Combination Approaches Wei-Yun Ma Given the wide range of successful statistical MT approaches that have emerged recently, it would be beneficial to take advantage of their individual strengths and avoid their individual weaknesses. Multi-Engine Machine Translation (MEMT) attempts to do so by ei...
متن کاملThe Performance of Iranian EFL Learners in Producing and Recognizing Idiom-Containing Sentences
This study aimed to investigate how Iranian EFL learners performed in producing sentences containing idioms and whether they had any problems in producing such sentences. This query, subsequently, raised the question of whether idioms influenced the participants’ grammaticality judgment on idiom-containing sentences. For this purpose, firstly, the writings of 24 learners were investigated for a...
متن کامل